Overview

Dataset statistics

Number of variables11
Number of observations800
Missing cells387
Missing cells (%)4.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory197.0 KiB
Average record size in memory252.1 B

Variable types

NUM7
CAT3
BOOL1

Reproduction

Analysis started2020-08-10 12:59:53.648800
Analysis finished2020-08-10 13:00:22.781228
Duration29.13 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Name has a high cardinality: 799 distinct values High cardinality
Type 2 has 386 (48.3%) missing values Missing
Name is uniformly distributed Uniform

Variables

Name
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count799
Unique (%)100.0%
Missing1
Missing (%)0.1%
Memory size6.4 KiB
Skorupi
 
1
Wurmple
 
1
Oddish
 
1
Mismagius
 
1
Wormadam Sandy Cloak
 
1
Other values (794)
794
ValueCountFrequency (%) 
Skorupi10.1%
 
Wurmple10.1%
 
Oddish10.1%
 
Mismagius10.1%
 
Wormadam Sandy Cloak10.1%
 
Monferno10.1%
 
Kakuna10.1%
 
Mega Tyranitar10.1%
 
Rapidash10.1%
 
Jirachi10.1%
 
Thundurus Therian Forme10.1%
 
Exploud10.1%
 
Exeggcute10.1%
 
Turtwig10.1%
 
Porygon210.1%
 
Mega Aggron10.1%
 
Buizel10.1%
 
Nidoran♀10.1%
 
Sharpedo10.1%
 
Luxray10.1%
 
Pumpkaboo Large Size10.1%
 
Cinccino10.1%
 
Skrelp10.1%
 
Geodude10.1%
 
Ursaring10.1%
 
Other values (774)77496.8%
 

Length

Max length25
Median length8
Mean length8.36875
Min length3

Overview of Unicode Properties

Unique unicode characters60
Unique unicode categories (?)7
Unique unicode scripts (?)2
Unique unicode blocks (?)3
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a6339.5%
 
e6099.1%
 
o5287.9%
 
r4787.1%
 
i4446.6%
 
n3625.4%
 
l3545.3%
 
t2974.4%
 
u2393.6%
 
s2053.1%
 
g1983.0%
 
m1682.5%
 
d1582.4%
 
h1412.1%
 
c1372.0%
 
1362.0%
 
S1291.9%
 
p1291.9%
 
M1201.8%
 
y1101.6%
 
k1031.5%
 
b851.3%
 
w691.0%
 
C620.9%
 
G580.9%
 
Other values (35)74311.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter561483.9%
 
Uppercase Letter93714.0%
 
Space Separator1362.0%
 
Other Punctuation3< 0.1%
 
Other Symbol2< 0.1%
 
Dash Punctuation2< 0.1%
 
Decimal Number1< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S12913.8%
 
M12012.8%
 
C626.6%
 
G586.2%
 
P566.0%
 
F505.3%
 
A475.0%
 
D475.0%
 
B454.8%
 
T444.7%
 
L424.5%
 
H333.5%
 
R323.4%
 
K303.2%
 
W252.7%
 
V232.5%
 
E212.2%
 
N171.8%
 
Z121.3%
 
I80.9%
 
J80.9%
 
O80.9%
 
Y60.6%
 
U60.6%
 
X40.4%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a63311.3%
 
e60910.8%
 
o5289.4%
 
r4788.5%
 
i4447.9%
 
n3626.4%
 
l3546.3%
 
t2975.3%
 
u2394.3%
 
s2053.7%
 
g1983.5%
 
m1683.0%
 
d1582.8%
 
h1412.5%
 
c1372.4%
 
p1292.3%
 
y1102.0%
 
k1031.8%
 
b851.5%
 
w691.2%
 
f520.9%
 
z400.7%
 
v340.6%
 
x280.5%
 
q80.1%
 
Other values (2)50.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
136100.0%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.266.7%
 
'133.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
21100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-2100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin655197.8%
 
Common1442.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a6339.7%
 
e6099.3%
 
o5288.1%
 
r4787.3%
 
i4446.8%
 
n3625.5%
 
l3545.4%
 
t2974.5%
 
u2393.6%
 
s2053.1%
 
g1983.0%
 
m1682.6%
 
d1582.4%
 
h1412.2%
 
c1372.1%
 
S1292.0%
 
p1292.0%
 
M1201.8%
 
y1101.7%
 
k1031.6%
 
b851.3%
 
w691.1%
 
C620.9%
 
G580.9%
 
P560.9%
 
Other values (28)67910.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
13694.4%
 
.21.4%
 
-21.4%
 
10.7%
 
10.7%
 
'10.7%
 
210.7%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII669199.9%
 
Misc Symbols2< 0.1%
 
None2< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a6339.5%
 
e6099.1%
 
o5287.9%
 
r4787.1%
 
i4446.6%
 
n3625.4%
 
l3545.3%
 
t2974.4%
 
u2393.6%
 
s2053.1%
 
g1983.0%
 
m1682.5%
 
d1582.4%
 
h1412.1%
 
c1372.0%
 
1362.0%
 
S1291.9%
 
p1291.9%
 
M1201.8%
 
y1101.6%
 
k1031.5%
 
b851.3%
 
w691.0%
 
C620.9%
 
G580.9%
 
Other values (32)73911.0%
 

Most frequent Misc Symbols characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent None characters

ValueCountFrequency (%) 
é2100.0%
 

Type 1
Categorical

Distinct count18
Unique (%)2.2%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
Water
112
Normal
98
Grass
 
70
Bug
 
69
Psychic
 
57
Other values (13)
394
ValueCountFrequency (%) 
Water11214.0%
 
Normal9812.2%
 
Grass708.8%
 
Bug698.6%
 
Psychic577.1%
 
Fire526.5%
 
Rock445.5%
 
Electric445.5%
 
Ground324.0%
 
Dragon324.0%
 
Ghost324.0%
 
Dark313.9%
 
Poison283.5%
 
Fighting273.4%
 
Steel273.4%
 
Ice243.0%
 
Fairy172.1%
 
Flying40.5%
 

Length

Max length8
Median length5
Mean length5.26
Min length3

Overview of Unicode Properties

Unique unicode characters28
Unique unicode categories (?)2
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
r48811.6%
 
a3608.6%
 
o2947.0%
 
e2866.8%
 
c2706.4%
 
s2576.1%
 
i2566.1%
 
t2425.8%
 
l1734.1%
 
g1593.8%
 
G1343.2%
 
n1232.9%
 
h1162.8%
 
W1122.7%
 
u1012.4%
 
F1002.4%
 
N982.3%
 
m982.3%
 
P852.0%
 
y781.9%
 
k751.8%
 
B691.6%
 
D631.5%
 
E441.0%
 
R441.0%
 
Other values (3)832.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter340881.0%
 
Uppercase Letter80019.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
G13416.8%
 
W11214.0%
 
F10012.5%
 
N9812.2%
 
P8510.6%
 
B698.6%
 
D637.9%
 
E445.5%
 
R445.5%
 
S273.4%
 
I243.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
r48814.3%
 
a36010.6%
 
o2948.6%
 
e2868.4%
 
c2707.9%
 
s2577.5%
 
i2567.5%
 
t2427.1%
 
l1735.1%
 
g1594.7%
 
n1233.6%
 
h1163.4%
 
u1013.0%
 
m982.9%
 
y782.3%
 
k752.2%
 
d320.9%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin4208100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
r48811.6%
 
a3608.6%
 
o2947.0%
 
e2866.8%
 
c2706.4%
 
s2576.1%
 
i2566.1%
 
t2425.8%
 
l1734.1%
 
g1593.8%
 
G1343.2%
 
n1232.9%
 
h1162.8%
 
W1122.7%
 
u1012.4%
 
F1002.4%
 
N982.3%
 
m982.3%
 
P852.0%
 
y781.9%
 
k751.8%
 
B691.6%
 
D631.5%
 
E441.0%
 
R441.0%
 
Other values (3)832.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII4208100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
r48811.6%
 
a3608.6%
 
o2947.0%
 
e2866.8%
 
c2706.4%
 
s2576.1%
 
i2566.1%
 
t2425.8%
 
l1734.1%
 
g1593.8%
 
G1343.2%
 
n1232.9%
 
h1162.8%
 
W1122.7%
 
u1012.4%
 
F1002.4%
 
N982.3%
 
m982.3%
 
P852.0%
 
y781.9%
 
k751.8%
 
B691.6%
 
D631.5%
 
E441.0%
 
R441.0%
 
Other values (3)832.0%
 

Type 2
Categorical

MISSING

Distinct count18
Unique (%)4.3%
Missing386
Missing (%)48.3%
Memory size6.4 KiB
Flying
97
Ground
 
35
Poison
 
34
Psychic
 
33
Fighting
 
26
Other values (13)
189
ValueCountFrequency (%) 
Flying9712.1%
 
Ground354.4%
 
Poison344.2%
 
Psychic334.1%
 
Fighting263.2%
 
Grass253.1%
 
Fairy232.9%
 
Steel222.8%
 
Dark202.5%
 
Dragon182.2%
 
Water141.8%
 
Ice141.8%
 
Ghost141.8%
 
Rock141.8%
 
Fire121.5%
 
Electric60.8%
 
Normal40.5%
 
Bug30.4%
 
(Missing)38648.2%
 

Length

Max length8
Median length3
Mean length4.3725
Min length3

Overview of Unicode Properties

Unique unicode characters28
Unique unicode categories (?)2
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n98228.1%
 
a49014.0%
 
i2577.3%
 
g1704.9%
 
F1584.5%
 
r1574.5%
 
o1534.4%
 
y1534.4%
 
s1313.7%
 
l1293.7%
 
c1063.0%
 
e902.6%
 
t822.3%
 
G742.1%
 
h732.1%
 
P671.9%
 
D381.1%
 
u381.1%
 
d351.0%
 
k341.0%
 
S220.6%
 
I140.4%
 
R140.4%
 
W140.4%
 
E60.2%
 
Other values (3)110.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter308488.2%
 
Uppercase Letter41411.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
F15838.2%
 
G7417.9%
 
P6716.2%
 
D389.2%
 
S225.3%
 
I143.4%
 
R143.4%
 
W143.4%
 
E61.4%
 
N41.0%
 
B30.7%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n98231.8%
 
a49015.9%
 
i2578.3%
 
g1705.5%
 
r1575.1%
 
o1535.0%
 
y1535.0%
 
s1314.2%
 
l1294.2%
 
c1063.4%
 
e902.9%
 
t822.7%
 
h732.4%
 
u381.2%
 
d351.1%
 
k341.1%
 
m40.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin3498100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n98228.1%
 
a49014.0%
 
i2577.3%
 
g1704.9%
 
F1584.5%
 
r1574.5%
 
o1534.4%
 
y1534.4%
 
s1313.7%
 
l1293.7%
 
c1063.0%
 
e902.6%
 
t822.3%
 
G742.1%
 
h732.1%
 
P671.9%
 
D381.1%
 
u381.1%
 
d351.0%
 
k341.0%
 
S220.6%
 
I140.4%
 
R140.4%
 
W140.4%
 
E60.2%
 
Other values (3)110.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII3498100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n98228.1%
 
a49014.0%
 
i2577.3%
 
g1704.9%
 
F1584.5%
 
r1574.5%
 
o1534.4%
 
y1534.4%
 
s1313.7%
 
l1293.7%
 
c1063.0%
 
e902.6%
 
t822.3%
 
G742.1%
 
h732.1%
 
P671.9%
 
D381.1%
 
u381.1%
 
d351.0%
 
k341.0%
 
S220.6%
 
I140.4%
 
R140.4%
 
W140.4%
 
E60.2%
 
Other values (3)110.3%
 

HP
Real number (ℝ≥0)

Distinct count94
Unique (%)11.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.25875
Minimum1
Maximum255
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum1
5-th percentile35.95
Q150
median65
Q380
95-th percentile110
Maximum255
Range254
Interquartile range (IQR)30

Descriptive statistics

Standard deviation25.53466903
Coefficient of variation (CV)0.368685098
Kurtosis7.232078374
Mean69.25875
Median Absolute Deviation (MAD)15
Skewness1.568224376
Sum55407
Variance652.0193226
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
60678.4%
 
50637.9%
 
70577.1%
 
65465.8%
 
75435.4%
 
80435.4%
 
40384.8%
 
45384.8%
 
55374.6%
 
100324.0%
 
90273.4%
 
95222.8%
 
85192.4%
 
35151.9%
 
30131.6%
 
105101.2%
 
7891.1%
 
11091.1%
 
4470.9%
 
7970.9%
 
6270.9%
 
6870.9%
 
9170.9%
 
3860.8%
 
5960.8%
 
Other values (69)16520.6%
 
ValueCountFrequency (%) 
110.1%
 
1010.1%
 
2060.8%
 
2520.2%
 
2810.1%
 
30131.6%
 
3110.1%
 
35151.9%
 
3610.1%
 
3710.1%
 
ValueCountFrequency (%) 
25510.1%
 
25010.1%
 
19010.1%
 
17010.1%
 
16510.1%
 
16010.1%
 
15040.5%
 
14410.1%
 
14010.1%
 
13510.1%
 

Attack
Real number (ℝ≥0)

Distinct count111
Unique (%)13.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.00125
Minimum5
Maximum190
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum5
5-th percentile30
Q155
median75
Q3100
95-th percentile136.2
Maximum190
Range185
Interquartile range (IQR)45

Descriptive statistics

Standard deviation32.45736587
Coefficient of variation (CV)0.4108462318
Kurtosis0.1697173149
Mean79.00125
Median Absolute Deviation (MAD)20
Skewness0.551613748
Sum63201
Variance1053.480599
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
100405.0%
 
65394.9%
 
80374.6%
 
50374.6%
 
85334.1%
 
60334.1%
 
75324.0%
 
70313.9%
 
90303.8%
 
55303.8%
 
45273.4%
 
95263.2%
 
120212.6%
 
40212.6%
 
30202.5%
 
110182.2%
 
105172.1%
 
130141.8%
 
125141.8%
 
35131.6%
 
150111.4%
 
11591.1%
 
4891.1%
 
2081.0%
 
9270.9%
 
Other values (86)22327.9%
 
ValueCountFrequency (%) 
520.2%
 
1030.4%
 
1510.1%
 
2081.0%
 
2210.1%
 
2310.1%
 
2410.1%
 
2570.9%
 
2710.1%
 
2910.1%
 
ValueCountFrequency (%) 
19010.1%
 
18510.1%
 
18030.4%
 
17020.2%
 
16530.4%
 
16410.1%
 
16050.6%
 
15520.2%
 
150111.4%
 
14710.1%
 

Defense
Real number (ℝ≥0)

Distinct count103
Unique (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean73.8425
Minimum5
Maximum230
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum5
5-th percentile35
Q150
median70
Q390
95-th percentile130
Maximum230
Range225
Interquartile range (IQR)40

Descriptive statistics

Standard deviation31.18350056
Coefficient of variation (CV)0.422297465
Kurtosis2.72626036
Mean73.8425
Median Absolute Deviation (MAD)20
Skewness1.155912303
Sum59074
Variance972.4107071
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
70546.8%
 
50496.1%
 
60465.8%
 
80394.9%
 
40364.5%
 
65364.5%
 
90354.4%
 
100334.1%
 
55324.0%
 
45324.0%
 
85293.6%
 
75263.2%
 
95263.2%
 
35232.9%
 
105151.9%
 
30141.8%
 
120131.6%
 
115111.4%
 
110111.4%
 
48111.4%
 
130101.2%
 
6370.9%
 
15070.9%
 
6270.9%
 
6770.9%
 
Other values (78)19123.9%
 
ValueCountFrequency (%) 
520.2%
 
1010.1%
 
1540.5%
 
2040.5%
 
2310.1%
 
2520.2%
 
2810.1%
 
30141.8%
 
3220.2%
 
3310.1%
 
ValueCountFrequency (%) 
23030.4%
 
20020.2%
 
18410.1%
 
18030.4%
 
16810.1%
 
16030.4%
 
15070.9%
 
14520.2%
 
14060.8%
 
13520.2%
 

Sp. Atk
Real number (ℝ≥0)

Distinct count105
Unique (%)13.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.82
Minimum10
Maximum194
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum10
5-th percentile30
Q149.75
median65
Q395
95-th percentile131.05
Maximum194
Range184
Interquartile range (IQR)45.25

Descriptive statistics

Standard deviation32.72229417
Coefficient of variation (CV)0.4493586126
Kurtosis0.2978936607
Mean72.82
Median Absolute Deviation (MAD)20
Skewness0.7446624978
Sum58256
Variance1070.748536
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
60516.4%
 
40496.1%
 
65445.5%
 
50394.9%
 
55354.4%
 
45334.1%
 
70303.8%
 
35293.6%
 
85273.4%
 
80273.4%
 
95273.4%
 
100273.4%
 
30243.0%
 
90212.6%
 
105202.5%
 
75182.2%
 
110162.0%
 
120141.8%
 
125131.6%
 
130111.4%
 
25111.4%
 
11591.1%
 
15091.1%
 
8381.0%
 
4481.0%
 
Other values (80)20025.0%
 
ValueCountFrequency (%) 
1030.4%
 
1540.5%
 
2081.0%
 
2310.1%
 
2420.2%
 
25111.4%
 
2720.2%
 
2910.1%
 
30243.0%
 
3110.1%
 
ValueCountFrequency (%) 
19410.1%
 
18030.4%
 
17510.1%
 
17030.4%
 
16520.2%
 
16020.2%
 
15910.1%
 
15420.2%
 
15091.1%
 
14540.5%
 

Sp. Def
Real number (ℝ≥0)

Distinct count92
Unique (%)11.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71.9025
Minimum20
Maximum230
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum20
5-th percentile32.95
Q150
median70
Q390
95-th percentile120
Maximum230
Range210
Interquartile range (IQR)40

Descriptive statistics

Standard deviation27.8289158
Coefficient of variation (CV)0.3870368318
Kurtosis1.628394057
Mean71.9025
Median Absolute Deviation (MAD)20
Skewness0.8540186115
Sum57522
Variance774.4485544
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
80526.5%
 
50506.2%
 
55475.9%
 
65445.5%
 
60435.4%
 
75405.0%
 
70405.0%
 
90364.5%
 
45354.4%
 
85303.8%
 
40303.8%
 
95293.6%
 
100283.5%
 
30202.5%
 
35182.2%
 
105172.1%
 
110151.9%
 
120131.6%
 
25111.4%
 
63101.2%
 
115101.2%
 
4891.1%
 
13091.1%
 
15070.9%
 
5670.9%
 
Other values (67)15018.8%
 
ValueCountFrequency (%) 
2060.8%
 
2310.1%
 
25111.4%
 
30202.5%
 
3110.1%
 
3210.1%
 
3310.1%
 
3410.1%
 
35182.2%
 
3610.1%
 
ValueCountFrequency (%) 
23010.1%
 
20010.1%
 
16020.2%
 
15430.4%
 
15070.9%
 
14020.2%
 
13810.1%
 
13540.5%
 
13091.1%
 
12910.1%
 

Speed
Real number (ℝ≥0)

Distinct count108
Unique (%)13.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.2775
Minimum5
Maximum180
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum5
5-th percentile25
Q145
median65
Q390
95-th percentile115
Maximum180
Range175
Interquartile range (IQR)45

Descriptive statistics

Standard deviation29.06047372
Coefficient of variation (CV)0.4256229903
Kurtosis-0.2364366728
Mean68.2775
Median Absolute Deviation (MAD)21
Skewness0.3579332951
Sum54622
Variance844.5111327
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
50465.8%
 
60445.5%
 
70374.6%
 
65364.5%
 
30354.4%
 
80334.1%
 
40324.0%
 
90313.9%
 
100313.9%
 
55303.8%
 
45293.6%
 
85273.4%
 
95273.4%
 
35222.8%
 
75162.0%
 
110151.9%
 
20151.9%
 
105121.5%
 
115111.4%
 
25101.2%
 
1591.1%
 
5881.0%
 
10870.9%
 
10170.9%
 
6860.8%
 
Other values (83)22428.0%
 
ValueCountFrequency (%) 
520.2%
 
1030.4%
 
1591.1%
 
20151.9%
 
2210.1%
 
2340.5%
 
2410.1%
 
25101.2%
 
2840.5%
 
2930.4%
 
ValueCountFrequency (%) 
18010.1%
 
16010.1%
 
15040.5%
 
14530.4%
 
14020.2%
 
13520.2%
 
13060.8%
 
12810.1%
 
12710.1%
 
12610.1%
 

Generation
Real number (ℝ≥0)

Distinct count6
Unique (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.32375
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Memory size6.4 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile6
Maximum6
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6612904
Coefficient of variation (CV)0.4998241145
Kurtosis-1.239575758
Mean3.32375
Median Absolute Deviation (MAD)2
Skewness0.01425810028
Sum2659
Variance2.759885795
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
116620.8%
 
516520.6%
 
316020.0%
 
412115.1%
 
210613.2%
 
68210.2%
 
ValueCountFrequency (%) 
116620.8%
 
210613.2%
 
316020.0%
 
412115.1%
 
516520.6%
 
68210.2%
 
ValueCountFrequency (%) 
68210.2%
 
516520.6%
 
412115.1%
 
316020.0%
 
210613.2%
 
116620.8%
 

Legendary
Boolean

Distinct count2
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size928.0 B
False
735
True
 
65
ValueCountFrequency (%) 
False73591.9%
 
True658.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

NameType 1Type 2HPAttackDefenseSp. AtkSp. DefSpeedGenerationLegendary
0BulbasaurGrassPoison4549496565451False
1IvysaurGrassPoison6062638080601False
2VenusaurGrassPoison808283100100801False
3Mega VenusaurGrassPoison80100123122120801False
4CharmanderFireNaN3952436050651False
5CharmeleonFireNaN5864588065801False
6CharizardFireFlying788478109851001False
7Mega Charizard XFireDragon78130111130851001False
8Mega Charizard YFireFlying78104781591151001False
9SquirtleWaterNaN4448655064431False

Last rows

NameType 1Type 2HPAttackDefenseSp. AtkSp. DefSpeedGenerationLegendary
790NoibatFlyingDragon4030354540556False
791NoivernFlyingDragon85708097801236False
792XerneasFairyNaN1261319513198996True
793YveltalDarkFlying1261319513198996True
794Zygarde Half FormeDragonGround1081001218195956True
795DiancieRockFairy50100150100150506True
796Mega DiancieRockFairy501601101601101106True
797Hoopa ConfinedPsychicGhost8011060150130706True
798Hoopa UnboundPsychicDark8016060170130806True
799VolcanionFireWater8011012013090706True